Comparison of two frequency domain pitch detectors based on phonetic performance evaluation

نویسندگان

  • Helge B. D. Sørensen
  • Paul Dalsgaard
چکیده

A comparison of two pitch detection algorithms using frequency domain analysis is presented. The algorithm showing the best results based on a perfomance evaluation done by a phonetician will be selected for use in an automatic speech recogni tion (ASR-) system based on the acoustic phonetic approach using knowledge-bases with an expert system. The first algori thm is the Seneff pi tch detection algori thm, which uses purely harmonic information taken from a preselected part of the amplitude FFT-spectrum. The harmonics are picked from this spectrum and peak processing heuristics estimates the pi tch. The second algori thm is the Charpentier pi tch detection algori thm, which uses information from the ampli tude and the phase spectrum. Based on a combination of the ampli tude and the phase information the pitch is estimated. The paper will compare the resul ts from the two algori thms based on evaluation performed by a phonetician. The most reliable pi tch algorithm will be used in an ASR-system, which is based on the acoustic phonetic method. The system is running in a microcomputer environment, where phonetic feature extraction (formants, pitch etc. ) is performed in the low level of the system in real time preprocessors and used in the high level of the system, which is a dedicated knowledge-based expert system. The pi tch algori thm will be implemented in a floating-point 32 bit signal processor. Aalborg University, Speech Technology Centre, 19, Strandvejen, DK 9000 Aalborg, Denmark

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of Iranian Traditional Music Dastgahs Using Features Based on Pitch Frequency

The Iranian traditional music is composed of seven majors Dastgahs: Chahargah, Homayoun, Mahour, Segah, Shour, Nava, and Rast-Panjgah. In this paper, a new algorithm for the classification of the Iranian traditional music Dastgahs based on pitch frequency is proposed. In this algorithm, the features of Lagrange coefficients of pitch logarithm (LCPL), Fuzzy similarity sets type 2 (FSST2), and th...

متن کامل

Pitch Elbow Detection

For the purpose of automatic and consistent alignment of tonal targets relative to phonetic segments we introduce one established and three new methods for automatic pitch elbow location. We further examine, whether it is beneficial to constrain the detectors to certain elbow shape types. An evaluation on hand-labeled data showed deviations from 32 to 58 ms between predicted and reference elbow...

متن کامل

A Subjective Evaluation of Pitch Detection Methods Using LPC Synthesized Speech

A subjective evaluation of seven pitch detectors has been carried out using synthetic speech. The evaluation is intended to complement the objective performance evaluation of the same pitch detection algorithms in the investigation of Rabiner et al. [1]. In the earlier study, each of the seven algorithms was evaluated on the basis of its performance with respect to four different types of error...

متن کامل

A Pitch-Catch Based Online Structural Health Monitoring of Pressure Vessels, Considering Corrosion Formation

Structural health monitoring is a developing research field which is multifunctional and can estimate the health condition of the structure by data analyzing and also can prognosticate the structural damages. Illuminating the damages by using piezoelectric sensors is one of the most effective techniques in structural health monitoring. Pressurized equipments are very important components in pro...

متن کامل

Automatic detection of prosodic prominence in continuous speech

This paper presents work in progress on the automatic detection of prosodic prominence in continuous speech. Prosodic prominence involves two different phonetic features: pitch accents, connected with fundamental frequency (F0) movements and syllable overall energy, and stress, which exhibits a strong correlation with syllable duration and high-frequency emphasis. By deriving a set of acoustic ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1987